Deep Poetry: Word-Level and Character-Level Language Models for Shakespearean Sonnet Generation

نویسندگان

  • Stanley Xie
  • Ruchir Rastogi
چکیده

Text generation is a foundational task in natural language processing, forming the core of a diverse set of practical applications ranging from image captioning and text summarization to question answering. However, most of this work has focused on generating prose. We investigate whether deep learning systems can be used to synthesize poetry, in particular Shakespearean-styled works. Previous work on generating Shakespeare prose involved training models exclusively on the word or character level. Here, we implement those previous models for poetry generation and show that models that combine word and character level information, such as a Gated LSTM and a CNN-based LSTM, significantly outperform the baseline word-LSTM and char-LSTM models. Perplexity scores for the two complex models are almost 10 fold better than that for our baselines, and human ratings of the model-generated sonnets reflect this as well. In particular, the sonnets our complex models generate have a coherent meaning and relatively correct meter without blatantly copying Shakespeare’s original works. These results encourage us that models that blend word and character level information would be useful for a variety of tasks outside of just poetry generation and may be crucial in bridging the gap between computer generated and human written text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Make Deep Learning Great Again: Character-Level RNN Speech Generation in the Style of Donald Trump

A character-level recurrent neural network (RNN) is a statistical language model capable of producing text that superficially resembles a training corpus. The efficacy of these networks in mimicking Shakespeare, Linux source code, and other forms of text have already been demonstrated. In this paper, we show that character-level RNNs are capable of very believably mimicking the language of Pres...

متن کامل

DeepStance at SemEval-2016 Task 6: Detecting Stance in Tweets Using Character and Word-Level CNNs

This paper describes our approach for the Detecting Stance in Tweets task (SemEval-2016 Task 6). We utilized recent advances in short text categorization using deep learning to create word-level and character-level models. The choice between word-level and characterlevel models in each particular case was informed through validation performance. Our final system is a combination of classifiers ...

متن کامل

Adversarial Objectives for Text Generation

Language models can be used to generate text by iteratively sampling words conditioned on previously sampled words. In this work, we explore adversarial objectives to obtain good text generations by training a recurrent language model to keep its hidden state statistics during sampling similar to what it has sees during maximum likelihood estimation (MLE) training. We analyze the convergence of...

متن کامل

Multi-level Representations for Fine-Grained Typing of Knowledge Base Entities

Entities are essential elements of natural language. In this paper, we present methods for learning multi-level representations of entities on three complementary levels: character (character patterns in entity names extracted, e.g., by neural networks), word (embeddings of words in entity names) and entity (entity embeddings). We investigate state-of-theart learning methods on each level and f...

متن کامل

Language Identification of Bengali-English Code-Mixed data using Character&Phonetic based LSTM Models

Language identification of social media text still remains a challenging task due to properties like code-mixing and inconsistent phonetic transliterations. In this paper, we present a supervised learning approach for language identification at the word level of low resource BengaliEnglish code-mixed data taken from social media. We employ two methods of word encoding, namely character based an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017